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Abstract 

Recent studies on microRNA (miRNA) evolution focused mainly on the comparison of miRNA complements between 
animal clades. However, evolution of mi RN As within such groups is poorly explored despite the availability of comparable 
data that in some cases lack only a few key taxa. For flatworms (Platyhelminthes), miRNA complements are available for 
some free-living flatworms and all major parasitic lineages, except for the Monogenea. We present the miRNA comple- 
ment of the monogenean flatworm Gyrodactylus salaris that facilitates a comprehensive analysis of miRNA evolution in 
Platyhelminthes. Using the newly designed bioinformatics pipeline miRCandRef, the miRNA complement was disen- 
tangled from next-generation sequencing of small RNAs and genomic DNA without a priori genome assembly. It consists 
of 39 miRNA hairpin loci of conserved miRNA families, and 22 novel miRNAs. A comparison with the miRNA comple- 
ments of Schmidtea mediterranea (Turbellaria), Schistosoma japonicum (Trematoda), and Echinococcus granulosus 
(Cestoda) reveals a substantial loss of conserved bilaterian, protostomian, and lophotrochozoan miRNAs. Eight of the 
46 expected conserved miRNAs were lost in all flatworms, 16 in Neodermata and 24 conserved miRNAs could not be 
detected in the cestode and the trematode. Such a gradual loss of miRNAs has not been reported before for other animal 
phyla. Currently, little is known about miRNAs in Platyhelminthes, and for the majority of the lost miRNAs there is no 
prediction of function. As suggested earlier they might be related to morphological simplifications. The presence and 
absence of 153 conserved miRNAs was compared for platyhelminths and 32 other metazoan taxa. Phylogenetic analyses 
support the monophyly of Platyhelminthes (Turbellaria + Neodermata [Monogenea {Trematoda + Cestoda}]). 
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Introduction 

MicroRNAs (miRNAs) are single-stranded, approximately 
22-nucleotides long noncoding small RNAs that regulate 
gene expression in animals by posttranscriptional gene regu- 
lation. miRNA families are classified by their target-specific 
seed regions, usually the positions 2-7 of the mature miRNAs. 
The binding of the mature miRNA to the mRNA target site 
can lead to inhibition of the translation and degradation of 
the respective mRNA (for details see Pasquinelli [2012]). In 
animals, miRNAs may play key roles in a broad variety of 
biological processes, such as, for example, cell proliferation 
and metabolism (Brennecke and Cohen 2003), tissue identity 
(Christodoulou et al. 2010), developmental timing (Reinhart 
et al. 2000), cell death (Baehrecke 2003), hematopoiesis (Chen 
et al. 2004), neuron development (Johnston and Hobert 
2003), tumorigenesis (Michael et al. 2003), DNA methylation 
and chromatin modification (Bao et al. 2004), as well as in 
immune defense against viruses (Sarnow et al. 2006). By now, 
more than 19,000 putative miRNAs from animals, plants, 
and viruses have been described and deposited in public 
databases such as miRBase (release 19) (Kozomara and 



Griffiths-Jones 2011). During evolution of most animal line- 
ages, miRNA loci are highly conserved and have been added 
to genomes rather than being lost (Sperling and Peterson 
2009; Wheeler et al. 2009; Tarver et al. 2013). Accordingly, it 
is primarily the gain of miRNA families that is phylogenetically 
informative and has led to a number of phylogenetic studies 
that focus on miRNAs as phylogenetic markers. This seems 
particularly true for major animal lineages such as, for exam- 
ple, Bilateria, Protostomia, and Deuterostomia, where group- 
specific gains of miRNA families could be assigned (Wheeler 
et al. 2009; Christodoulou et al. 2010; Erwin et al. 2011). Only 
to some extent this has been studied in more derived taxo- 
nomic groups (Sperling et al. 2009; Heimberg et al. 2010; 
Campbell et al. 2011; Rota-Stabelli et al. 2011; Sperling et al. 
2011; Wiegmann et al. 2011; Helm et al. 2012), focusing on 
conserved miRNAs or novel miRNAs that were shared be- 
tween certain species. In contrast, if losses of conserved 
miRNAs have been detected, there was usually no correspon- 
dence to any obvious phylogenetic pattern of the missing 
miRNAs (e.g., shared losses). The losses found in, for example, 
acoelan worms, ascidians, and nematodes were mosaic in 
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nature, leaving enough taxon-specific miRNAs for an assign- 
ment to a certain group (for summary see Sperling and 
Peterson [2009] and Philippe et al. [2011]). 

There are two major strategies when using miRNAs as 
phylogenetic markers: A first group of studies use miRNAs 
as additional evidence for testing phylogenetic hypothesis 
inferred with other markers. Frequently, the presence or ab- 
sence of conserved miRNA families is scored as phylogenetic 
information (Tarver et al. 2013). Within Diptera, for example, 
the distribution of miRNAs was shown to be in concordance 
with phylogenomic and morphological results (Wiegmann 
et al. 2011). Interestingly, the presence/absence of miRNA 
loci also supported earlier findings based on the analyses of 
mitochondrial genomes and expressed sequence tags (EST) 
markers that acoelan worms, that is, Symsagittifera roscoffen- 
sis and Hofstenia miamia, do not belong to Platyhelminthes 
(Philippe et al. 2011). The second group of studies is using 
miRNAs as the only phylogenetic marker. Applying this ap- 
proach, the monophyly of Cyclostomes (hagfish and lampreys 
as closest relatives) could be confirmed (Heimberg et al. 
2010). Similarly, Sperling et al. (2009) found miRNA support 
for the monophyly for Annelida, and Helm et al. (2012) con- 
firmed the close relationship of the parasitic and morpholog- 
ically enigmatic Myzostomida and Annelida. However, most 
studies using miRNAs as phylogenetic markers rely on the 
miRNA complement of only one species for each group, as- 
suming the relative constancy of the conserved miRNAs 
within them. This notion is of importance if changes of 
miRNA complements occurred that have rendered the 
focus species different than the remainder of the lineage. In 
particular, losses of conserved miRNA families may occur 
convergently in distantly related groups of organisms. 

Although not being the focus, some recent miRNA-based 
studies have included flatworms (Platyhelminthes), and by in- 
cluding Schmidtea mediterranea demonstrated that miRNAs 
support their phylogenetic affiliation to Lophotrochozoans 
(Erwin et al. 2011; Philippe et al. 2011). A study that included 
miRNA data of 3 flatworms, Schistosoma mansoni, Schi.japo- 
nicum, and S. mediterranea in a presence/absence matrix of 
71 conserved miRNA families over 17 species, however, does 
not support this finding (Helm et al. 201 2). In stark contrast to 
previous studies, they recovered flatworms as paraphyletic 
and basal to all other Bilaterians. Moreover, it became obvious 
that the included flatworms had significant differences in 
their miRNA complements that contradict the common un- 
derstanding of miRNA evolution. This raises questions about 
the phylogeny and the evolution of miRNAs in flatworms. 

Platyhelminthes (Gegenbauer, 1859) include approximately 
20,000 dorsoventrally flattened species. Platyhelminthes lack 
a true coelom and have traditionally been placed together 
with Acoela at the root of Bilateria. Today, however, 
Platyhelminthes are considered Protostomia (Grobben, 1908), 
either within the Lophotrochozoa (Halanych et al., 1995) or the 
Platyzoa (Cavalier-Smith, 1998). Acoela are an independent 
clade that is most likely basal to all bilaterians (Wallberg 
et al. 2007; Hejnol et al. 2009; Jondelius et al. 2011), or part 
of deuterostomes (Philippe et al. 2011). Platyhelminthes in- 
clude the polyphyletic free-living Turbellaria and the 



monophyletic and strictly parasitic Neodermata Cavalier- 
Smith (1998) that include almost 75% of all known flatworm 
species (Littlewood 2006). The divergence time of 
Neodermata and Turbellaria is difficult to assess because 
their fossil record is rather poor (Poinar 2003). It might 
reach 300 My back into the Permian, but recent molecular 
studies suggest an even older split some 510 Ma (Perkins 
2010), which roughly coincides with the occurrence of the 
early vertebrates (Peterson and Butterfield 2005). Neodermata 
are strictly dependent on vertebrate hosts and consist of the 
endoparasitic tapeworms (Cestoda [Rudolph, 1819]; ~ 1,000 
species, e.g., Echinococcus, Taenia, and Hymenolepis), the 
endoparasitic flukes (Trematoda [Rudolphi, 1808]; ~ 18,000 
species, e.g., Clonorchis, Schistosoma, and Fasciola), and the 
ectoparasitic Monogenea (Carus, 1863) (e.g., Gyrodactylus). 
Within Neodermata, the hermaphroditic Monogenea have 
the simplest life cycle with no intermediate hosts. They — in 
contrast to tapeworms and flukes — do not cause any human 
health issues, but some species are severe threats for aquacul- 
ture and fishery. In particular, the genus Gyrodactylus 
v. Nordmann (1832) caught public attention after 
Gyrodactylus salaris (Malmberg, 1957) was first reported as a 
pest of Atlantic salmon (Salmo salar) in Norway in the 1970s 
(Bakke et al. 2007). Most gyrodactylids are viviparous and 
highly progenetic ectoparasites, and it is expected that literally 
all teleost fishes are hosts for at least one Gyrodactylus species. 
Despite the great variety of forms, the monophyly of the 
Neodermata is strongly supported by the name-giving 
Neodermis (Littlewood 2006), but the phylogenetic relation- 
ships of the three main lineages within Neodermata have 
never been resolved unambiguously (Baguna and Riutort 
2004). Historically, the Monogenea have been considered 
the sister group to Cestoda (Rohde 1994; Littlewood et al. 
1999). In contrast, Perkins et al. (2010) found the 
Monogenea basal to a Trematoda + Cestoda clade when an- 
alyzing 32 platyhelminth mitochondrial genomes. Their re- 
sults supported earlier findings based on sequence analyses 
of ribosomal DNA (Lockyer et al. 2003), as well as individual 
mitochondrial genes (Park et al. 2007). However, phylogenetic 
trees based on sequences from a single molecular marker or 
solely on mitochondrial DNA might be inappropriate (Hurst 
and Jiggins 2005; Galtier et al. 2009). Direct and indirect selec- 
tion on the mitochondrial genome as well as its maternal 
mode of inheritance might confound the inference of evolu- 
tionary history (Hurst and Jiggins 2005; Galtier et al. 2009). 
Furthermore, Platyhelminthes are also a fast-evolving group 
and contrasting phylogenetic trees were shown to be due to 
long-branch attraction (Lartillot et al. 2007). Novel phyloge- 
netic markers like miRNAs have not yet been used to study 
their phylogeny. 

Until now, platyhelminth miRNA data have been available 
for the planarians S. mediterranea (Palakodeti et al. 2006; Lu 
et al. 2009; Friedlander et al. 2012) and Dugesia japonica (Xu 
et al. 2013), the trematodes Schi. japonicum (Xue et al. 2008; 
Hao et al. 2010; Wang et al. 2010), Schi. mansoni (de Souza 
Gomes et al. 2011), Orientobilharzia turkestanicum (Wang 
et al. 2012), Fasciola gigantica and F. hepatica (Xu et al. 
2012), and Glonorchis sinensis (Xu et al. 2010) as well as 
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the cestodes Echinococcus granulosus, £ multilocularis 
(Cucher et al. 2011), and Taenia saginata (Ai et al. 2012). 
Only the miRNA complements of S. mediterranea, Schi. japo- 
nicum, and £ granulosus — representatives of all major flat- 
worm groups except Monogenea — are based on either high- 
throughput sequencing or cloned libraries of small RNAs, 
along with the respective reference genomes. For all other 
studies, there is either no reference genome available for 
the studied species or no small RNA libraries, which limits 
the significance of their results considerably. The correspond- 
ing miRNA complements are very likely incomplete and thus 
not suitable for comparative analyses. 

This study therefore aimed at providing a miRNA comple- 
ment of the monogenean flatworm C. salaris. With data for at 
least one species for all major flatworm lineages at hand, we 
here analyze miRNA evolution in platyhelminths in more 
general terms. 

New Approaches 

The bioinformatics pipeline miRCandRef (miRNA-candi- 
date- reference) was developed to enable miRNA predictions 
for nongenome organisms like C. salaris. Briefly, miRCandRef 
uses genomic and small RNA data from next-generation se- 
quencing (NGS) and locally assembles genomic reads that 
match miRNA candidates from small RNA-reads into rela- 
tively short genomic contigs (crystal-contigs). MiRCandRef 
delivers a multi-fasta file of assembled and clustered miRNA 
crystal-contigs that can be used as a genomic reference for 
miRNA prediction software rather than a properly assembled 
genome. Using this approach allows for a comparably fast 
miRNA prediction directly from NGS reads and circumvents 
the need for costly and sometimes patchy draft genome- 
assemblies. In its current version, the pipeline includes a mod- 
ified version of miRDeep2 (Friedlander et al. 2012) that can 
handle the miRCandRef-fasta-output files and small RNA 
reads for miRNA prediction. To simplify the usage of the 
pipeline, miRCandRef has been implemented at the latest 
version of the hyperbrowser (http://hyperbrowser.uio.no, 
last accessed September 26, 2013) (Sandve et al. 2013), 
which is based on the Galaxy system (Giardine et al. 2005) 
and allows for a fully automated analysis based on small RNA 
and DNA-reads uploaded by the user. 

Results 

The miRNA Complement of G. salaris 
The miRCandRef analyses of NGS data sets of small RNAs and 
genomic DNA of G. salaris identified initially some 300,000 of 
the roughly 3 Mio unique small RNA NGS reads with perfect 
matches in one of the about 90 Mio genomic DNA reads. All 
genomic DNA subsets, including the respective mate se- 
quences, were assembled into 401,731 contigs, all representing 
miRNA-candidate-loci. Subsequently, this miRNA-candi- 
date reference assembly and the original small RNA data set 
were used in miRDeep2*, a modified version of miRDeepZ All 
predicted miRNA loci were manually checked for appropriate 
structure and offset of mature and star sequences required for 
Dicer processing, and thus all accepted miRNA loci had to be 



represented by at least one mature and one star read (Tarver 
et al. 2012). Accordingly, 39 G. salaris miRNAs (gsa-mir) with 
high sequence similarities to previously described miRNAs 
and 22 miRNAs with yet undescribed seeds were identified 
(see supplementary file S1, Supplementary Material online, for 
read-counts, structures including offsets, and alignments). 
Four highly abundant miRNAs with particularly long hairpins, 
miR-1 (represented with more than 2 Mio reads in the small 
RNA NGS read pool), miR-87 (represented with more than 
300,000 reads), mir-36 (represented with ~57,000 reads), and 
mir-9 (represented with ~8,000 reads) could only be detected 
after changing the miRDeep2 algorithmic parameters for the 
excision of hairpins. 

Loss and Gain of miRNAs in Platyhelminth Flatworms 
Earlier studies (Wheeler et al. 2009; Christodoulou et al. 2010; 
Erwin et al. 2011; Tarver et al. 2013) proposed ancestral 
miRNA complements for several animal lineages, and accord- 
ingly 46 conserved miRNAs families were expected in 
Gyrodactylus, that is, one that is characteristic for 
Eumetazoa, 31 for Bilateria, 12 for Protostomia, and two for 
Lophotrochozoa. Twenty-eight of the expected miRNA fam- 
ilies were indeed found, but a total of 18 miRNA loci were not 
represented in the data set (fig. 1). Homologs of 14 bilaterian, 
3 protostomian, and 1 lophotrochozoan miRNA families were 
not recovered. Mir-10, the only conserved miRNA character- 
istic for Eumetazoa, was found having several representatives 
in Gyrodactylus. BLAT-Searches (Kent 2002) of missing 
miRNA families in the genomic reference (crystal-contigs) 
did not recover additional bona fide miRNAs. 

The obtained miRNA complement of G. salaris was com- 
pared with the miRNA complements of the free-living flat- 
worm S. mediterranea (Turbellaria), the fluke Schi. japonicum 
(Trematoda), and the tapeworm £ granulosus (Cestoda). 
A total of 148 miRNAs are deposited in miRBase for S. med- 
iterranea that represent 37 conserved miRNA families and 38 
miRNAs unique to Schmidtea. We included five miRNAs 
without prior assignments to a conserved family, that is, 
the bilaterian-specific mir-153 (sme-mir-2163), mir-184 
(sme-mir-748), mir-190 (sme-mir-756), and mir-210 (sme- 
mir-2164) families, as well as the lophotrochozoan specific 
mir-1989 (sme-mir-2154). For Schi japonicum, 55 loci that 
represent 20 conserved miRNA families and 27 novel 
miRNAs including sja-mir-2162, which is a hitherto unre- 
ported member of the protostomian mir-1993 family were 
retrieved from miRBase. Because of the missing hairpin struc- 
tures, the previously described mir-92 (sja-mir-310) and mir- 
279 miRNAs (sja-mir-61) (Wang et al. 2010) were rejected as 
bona fide miRNAs and thus not included. The miR-1 33 (sja- 
mir-133) miRNA (Wang et al. 2010), also without a bona fide 
hairpin structure, was nevertheless accepted, because of 
recently published expression data and northern blot evi- 
dence (see miRBase entries MI0015298, MI0015290, and 
MI0015293). Only 23 precursors that consisted of 16 con- 
served families and just 3 novel miRNA were recovered 
from miRBase for £ granulosus, including mir-4989 that be- 
longs to the protostomian-specific mir-277 family. A further 
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Fig. 1. The miRNA complement of Gyrodactylus salaris shows lophotrochozoan features, (a) Evolutionary acquisition of miRNA families mapped on a 
simplified tree of Eumetazoa; (b) detailed table for group-specific conserved miRNAs (edited part of table from Wheeler et al. [2009] and results from 
Philippe et al. [2011]). Families are designated parenthetically; family names are bold, (c) miRNAs that are expected in G. salaris based on the affiliation 
to Lophotrochozoans and the actual miRNAs found (black); (d) detailed table for conserved miRNAs found in G. salaris and inferred losses/total 
number of conserved miRNA families. 



six miRNA families have been recently proposed for 
£ granulosus based on a blast-search approach (Jin et al. 
2013) out of which we found four representatives (mir-31, 
mir-133, mir-2162 [1993], and bantam) with properly folding 
hairpins and reasonable offsets for hypothetical mature and 
star reads (data available upon request). Eight of the con- 
served bilaterian, protostomian, and lophotrochozoan 
miRNA families that were not detected in C. salaris were 
also not found for the other flatworm miRNA complements 
(fig. 2); these were mir-33, mir-34, mir-137, mir-193, mir-242, 
mir-252, mir-31 7, and mir-375. In addition, the neodermatan 
Schi. japonicum, £ granulosus, and G. salaris lack the eight 
conserved miRNA families mir-12, mir-29, mir-92, mir-210, 
mir-216, mir-315, mir-750, and mir-1989, which sum up to 
16 shared losses of miRNA families in this clade. A clade 
consisting of Cestoda and Trematoda is supported by the 
shared loss of a further eight miRNA families (mir-76, mir- 
184, mir-278, mir-279, mir-281, mir-1175, mir-1992, and mir- 
2001). Thus, 24 of the 46 expected miRNAs, which include all 
lophotrochozoan-specifk miRNAs, were not found in 
Cestoda and Trematoda. The compiled data set also indicates 
that each flatworm lineage has lost further miRNA families 
independently; that is, mir-2001 in S. mediterranea, mir-22, 
and mir-133 in G. salaris, mir-22, mir-87, mir-96, and mir-153 



in Schi. japonicum, and mir-36, and mir-67 in £ granulosus. For 
each flatworm species, a set of novel miRNAs has been de- 
scribed. Surprisingly, no novel miRNAs were shared between 
flatworm lineages, that is, they were all species/lineage-speci- 
fic. The observed specificities of the novel miRNA include the 
hairpin and particularly the family-and-function determining 
seed sequences. Although nothing can be said about their 
function, they must have different target sequences. 

Multicopy miRNAs 

The miRNA complement of C. salaris includes several multi- 
copy miRNA families. Four hairpins consisted of mature se- 
quences with seeds from the miR-2 family, three miRNAs 
representing the miR-10 family, two copies for each of the 
miR-96, miR-124, miR-184, and miR-277 families. Two 
miRNAs with high similarity to the miR-71 family were 
found in clusters that include other miRNAs with seed se- 
quences known for members of the extended miR-2 family 
(fig. 3). All members of the extended mir-2 family from 
G. salaris were arranged in clusters, that is, gsa-mir-71a clus- 
tered together with gsa-mir-2a;-2b;-2a-2/b-2, and gsa-mir-71b 
with gsa-mir-2a-3 (fig. 3a). Clustered miR-2 family members 
have been described previously from several organisms, 
including flatworms (Marco et al. 2012). However, when 
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Fig. 2. Loss and gain of miRNAs in flatworms. (a) miRNA distribution in flatworms shows pattern of taxa-specific losses of conserved miRNA families 
and sets of novel miRNAs with unique seeds for each group. Red blow-up: presence/absence pattern of taxa-specific groups of conserved miRNAs 
(Eumetazoa, Bilateria, Protostomia, and Lophotrochozoa); black squares represent the presence of a miRNA family (note: predicted miRNAs without 
sufficient experimental evidence for expression, or expressed miRNAs without genomic confirmation are depicted by "?"). Losses are depicted as white 
fields or highlighted in colors (blue: shared losses in flatworms, green: shared losses in Neodermata, yellow: shared losses in Trematoda + Cestoda); 
(b) gain and loss of miRNA families in Eumetazoa and Platyhelminthes, branch lengths corresponds to number of gains/losses. 



comparing the mir-2 data of G. salaris with those of 
S. mediterranea, £ granulosus, and Schi. japonicum, it 
becomes likely that there are at least two extended mir-2 
miRNA clusters that contain a mir-71 each in flatworms 
(fig. 3b). 

Phylogenetic Analysis of miRNA Data 
A presence/absence matrix was compiled for 153 conserved 
miRNA families from 36 metazoan taxa (supplementary 
matrix, Supplementary Material online). Only miRNAs were 
included that were unambiguously identified on the basis of 
well-characterized seed regions. The subsequent maximum 
parsimony analysis with 151 parsimony-informative loci 
yielded 72 equally parsimonious trees each with 306 steps. 
The strict consensus tree (fig. 4) confirmed the monophyly 
of Platyhelminthes with moderate bootstrap and Bremer 
support and their affiliation to Lophotrochozoa. The free- 
living S. mediterranea was basal to the Neodermata. 
Within the Neodermata, the Monogenea (C. salaris) 
were basal to a clade comprising of Trematoda 
(Schi. japonicum) and Cestoda (£ granulosus). Within 
Neodermata, there was also moderate statistical support for 
the respective nodes. 



Discussion 

The miRNA complement of the monogenean flatworm 
G. salaris was comprehensively characterized using the 
newly developed bioinformatics pipeline miRCandRef. 
A major advantage of miRCandRef is that it does not require 
the availability of a reference genome. Instead, it utilizes a local 
assembly approach based on small RNA and genomic DNA 
read-pools. Considering only genomic reads that contain 
a miRNA-candidate sequence, that is, a small RNA read, 
the computational requirements (in particular CPU time) 
for the sequence assembly are significantly reduced. 
MiRCandRef also takes rather small contigs with a length of 
less than 200 bp into account; these are usually not consid- 
ered in de novo genome assemblies. Small modifications to 
miRDeep2 have been implemented that allow detecting 
additional miRNA-hairpins with unusual length (supple- 
mentary information, Supplementary Material online). It is 
likely they had been overlooked in a standard approach. 
Therefore, miRCandRef recovers more miRNAs candidate 
loci from NGS read-pools than other approaches that rely 
on de novo genome assemblies (e.g., with the default 
miRDeep2). 
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Fig. 3. Secondary structure of extended mir-2 family in Gyrodactylus salaris and their genomic organization in flatworms. (a) Secondary structure of the 
two extended mir-2 clusters found in G. salaris (mfold). Red marked sequences represent mature miRNAs, blue marked are star sequences; (b) 
comparison of mir-2 gene-arrangements in flatworms and as outgroup representative Capitella teleta (arrows: black mir71 seed: GAAAGA, white mir-2a 
seed: CACAGC, gray mir-2b/-13 seed AUCACA). *sme-mir-752 evolved by arm-switching from a mir-2a precursor. 



With respect to 46 conserved miRNAs earlier described 
for Metazoa, Bilateria, Protostomia, and Lophotrochozoa 
(Wheeler et al. 2009; Erwin et al. 2011), a significant loss of 
conserved miRNAs was detected in Monogenea. Altogether 
18 of the expected conserved miRNA families were not de- 
tected in G. salaris. When including miRNA complements of 
S. mediterranea (Turbellaria), Schi. japonicum (Trematoda), 
and £ granulosus (Cestoda) into the analyses, it turned out 
that the loss of conserved miRNA loci is common in flat- 
worms. In the free-living S. mediterranea, approximately 
20% of the conserved miRNA families could not be detected, 
whereas in the more derived Schi. japonicum and E. granulosus 
approximately 50% had been lost. Conserved miRNA families 
are not lost randomly, but there is a pattern that is congruent 
with the phylogenetic relationship Turbellaria + Neodermata 
([Monogenea {Trematoda + Cestoda}]). This means that 
losses of conserved miRNA families in platyhelminths are 
phylogenetically informative. It is important to note that 
the miRNA complement of £ granulosus representing 
Cestoda is derived from cloning data and is therefore most 
likely incomplete. Blast approaches have revealed further 
miRNAs (Jin et al. 2013) that were at least considered partly 
based on the available information about their structure. 
These hairpins were considered provisional for this study al- 
though conclusive proof is still pending. However, the extent 



of the inferred losses of conserved miRNAs in the closely 
related trematode Schi. japonicum is very similar, which was 
taken as support for assuming that the majority of miRNAs 
are included in the published complement of £ granulosus. 
Losses of more than 50% of conserved miRNA families as seen 
in endoparasitic platyhelminths have not yet been reported 
from any major lineage within other bilaterian phyla. Given 
the particular importance of miRNAs for the functionality of 
gene expression (Bartel and Chen 2004), it seems rather un- 
likely that about half of the otherwise conserved miRNAs just 
got lost in platyhelminths. Indeed, it has been suggested that 
miRNA losses might relate to loss of targets (Sperling and 
Peterson 2009) or reduced morphological complexity 
(Erwin et al. 2011). For Neodermata, there are some morpho- 
logical reductions, such as, for example, the loss of the gut in 
tapeworms. Recently, Tsai et al. (2013) also reported genomic 
reductions such as, for example, the loss of several homeobox 
gene families, and the absence of most genes essential for 
peroxisomes and fatty acid biosynthesis in flukes and tape- 
worms. It seems therefore likely that not only morphological 
changes but also metabolic changes relate to the loss of 
miRNAs. However, several novel miRNAs were identified 
for each platyhelminth lineage but function could not 
be proposed for these. Some recent studies demonstrated 
differentially expressed conserved miRNAs from various 
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Fig. 4. (a) Strict consensus of 24 equally parsimonious trees of a maximum parsimony analysis of 153 miRNA families in 36 metazoan taxa. The tree was 
rooted with Monosiga, Amphimedon, and Suberites. Numbers in black squares show evolutionary acquisitions of miRNA families; numbers in white 
circles show bootstrap values for 1,000 replicates, gray are Bremer support values. Squares colored in shades of gray depict Eumetazoa (E), Bilateria 
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developmental stages and sexes of platyhelminth worms, 
pointing to functional differences at different life stages 
(Friedlaender et al. 2009; Cai et al. 2011; Cucher et al. 2011). 
However, there is only very little detailed information relating 
to how miRNAs actually target mRNAs in animals. Rather, it is 
likely that miRNAs themselves are also frequently targets of 
other messenger RNAs (Sumazin et al. 2011). The few pub- 
lished studies on miRNA targets in flatworms were all per- 
formed in trematodes using either a bioinformatics approach 
for miRNA prediction from EST data that lack crucial small 
RNA read evidence (Schi. mansoni) (de Souza Gomes et al. 
2011) or predicted miRNAs without a genomic reference 
specific to the respective species prior to target prediction 
(Fasciola [Xu et al. 2012] and Orientobilharzia [Wang et al. 
2012]). Accordingly, these studies are of only limited value for 
understanding the functions of miRNA targets. For the mono- 
genean C. safaris, there is currently no fully assembled and 
annotated genome or EST data set available, which would be 
a prerequisite for a similar bioinformatics target analysis. 

A phylogenetic analyses of the presence/absence of 153 
miRNAs from 36 metazoan taxa recovered platyhelminths as 
monophyletic (fig. 4). In the current analysis, the number of 
loci included in the data matrix was more than doubled as 



compared with the previously published study of Helm et al. 
(2012). The increased matrix may largely explain the im- 
proved resolution of the tree, and the higher support for 
many nodes within (supplementary matrix, Supplementary 
Material online). 

The obtained phylogenetic tree was largely in agreement 
with the current view on metazoan phylogeny, which is lar- 
gely based on phylogenomic data sets as a gold standard 
(Edgecombe et al. 2011). Bilateria and Chordata are both 
monophyletic and well supported. However, there was no 
support for the monophyly of Lophotrochozoa/Spiralia, 
Deuterostomia, and Acoela. This can likely be attributed to 
the lack (Nephrozoa, Acoela, Ecdysozoa, and Trochozoa) or 
the low number (Deuterostomes [1], Lophotrochozoa [2]) of 
apomorphic miRNAs. Surprisingly, no novel miRNA was 
found apomorphic for Platyhelminthes, Neodermata, or the 
Trematoda + Cestoda clade. However, the shared losses of 
miRNAs were phylogentically informative, and reasonable sta- 
tistical support was found for Platyhelminthes, and for the 
Monogenea being basal in the Neodermata with a sister 
group consisting of the endoparasitic Trematoda and 
Cestoda. Although novel miRNAs provided no information 
for the phylogenetic relationships of flatworms, the 
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duplication of the mir-2 cluster, each including a further mir- 
71 hairpin, is likely a synapomorphy of Platyhelminthes. Two 
copies were described previously also for Schi. mansoni (two 
miR-71 ) (de Souza Gomes et al. 201 1 ), G sinensis (two miR-71 ) 
(Xu et al. 2010), and 0. turkestanicum (two mir-71) (Wang 
et al. 2012). In E. granulosus and £ multilocularis, a second mir- 
71 that is associated with a mir-2 is missing, even though a 
second mir-2 locus has been described (Cucher et al. 2011). 
This implies a secondary loss of a second mir-71 copy in 
cestodes. In contrast, S. mediterranea showed altogether 5 
extended mir-2 loci, of which 4 included also an additional 
mir-71. This indicates a secondary acquisition of further mir- 
71 12 clusters in Turbellaria. Despite all limitations of the 
miRNA data currently available for platyhelminth species, 
that is, limited taxonomic coverage, incorrectly annotated 
miRNA loci in miRBase and/or incomplete data, there is no 
question about the significant loss of otherwise conserved 
miRNAs. Interestingly, the observed numbers of lost miRNA 
families in flatworms is lowest in the free-living S. mediterra- 
nea, somewhat higher in the ectoparasitic G. salaris and high- 
est in the endoparasitic Schi. japonicum and E. granulosus. 
Currently, there is however no convincing evidence that 
would tie loss of miRNAs to parasitic lifestyle. Flatworms, 
and in particular the parasitic Neodermata, therefore offer 
an excellent model for addressing functional aspects of 
miRNA evolution. The pipeline miRCandRef offers a straight- 
forward analysis of miRNAs from nonmodel organisms by 
exploiting genomic information on the level of NGS reads 
and user-optimized usability at the hyperbrowser (http:// 
hyperbrowser.uio.no, last accessed September 26, 2013). 

Materials and Methods 

Next-Generation Sequencing of G. salaris Short RNA 
and Total Genomic DNA 

The G. salaris (Lier strain) parasites were cultured and har- 
vested as described before (Fromm et al. 2011). A total RNA 
sample enriched for small RNA was extracted from 100 
pooled individuals using the ZR RNA MicroPrep kit (Zymo 
Research). A small RNA library was prepared using the 
ScriptMiner Small RNA-Seq Multiplex Library Preparation 
Kit (Epicentre) and subsequently sequenced on an lllumina 
GA2 in a 55 bp, single-end run (one lane). A total genomic 
DNA extraction of approximately 20,000 pooled individuals 
was performed using the E.Z.N.A-Tissue Kit (Omega Bio-tek). 
The genomic DNA was subsequently used for library prepa- 
ration and subjected to a 76 bp, paired-end run on an 
lllumina GA2 instrument (one lane). 

Bioinformatics 

The algorithm miRCandRef was developed for assembling 
high-quality filtered NGS data into relatively short genomic 
contigs (crystal-contigs) that match predicted miRNA candi- 
date loci. The pipeline includes a modified version of 
miRDeep2 (Friedlander et al. 2012) and delivers a multi- 
fasta file of assembled and clustered miRNA crystal-contigs 
that can be used as a reference for miRNA prediction software 
like miRDeep2 rather than a properly assembled genome (for 



details see supplementary information, Supplementary 
Material online). To distinguish between missing and not ex- 
pressed miRNAs, the crystal-contigs were additionally 
screened for absent miRNA families. Known miRNAs in 
G. salaris were identified using a reference file that contained 
all miRNAs from S. mediterranea, Schi. japonicum and 
£ granulosus retrievable from miRBase. All miRNAs were 
compared with a database of 46 miRNAs that represent pro- 
posed ancestral miRNA complements of Eumetazoa, Bilateria, 
Protostomia, and Lophotrochozoa. Sequence similarity in the 
seed sequence of the mature miRNA and other regions of the 
hairpin were the key criteria for miRNA family assignment. 
miRNAs from the flatworms S. mediterranea, Schi. japonicum, 
and £ granulosus were retrieved from miRBase and analyzed 
accordingly. 

Phylogenetic Analysis 

The character matrix for 153 miRNAs from 36 taxa was an- 
alyzed using PAUP* v.4.0b10 (Swofford 2003). All characters 
were encoded for presence or absence and assigned equal 
weight. The analysis was performed with unordered charac- 
ters and a heuristic search implemented under a Dollo model. 
Hereby, each character is allowed to evolve only once and all 
homoplasy is explained by reversals from the ancestral state. 
Node support in the strict consensus tree of equally parsimo- 
nious trees was estimated by running 1,000 bootstrap repli- 
cates. Bremer support values were generated including all 
trees with up to six more steps (Bremer 1994). 

Supplementary Material 

Supplementary file S1, figs. S1 and S2, information, and matrix 
are available at Molecular Biology and Evolution online (http:// 
www.mbe.oxfordjournals.org/). 
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